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Abstract 

Top quark production in the forward region in proton-proton collisions is observed 
for the first time. The W+b final state with IT —)• is reconstructed using muons 
with a transverse momentum, p^', larger than 25GeV in the pseudorapidity range 
2.0 < ?7 < 4.5. The b jets are required to have 50 < px < 100 GeV and 2.2 < p < 4.2, 
while the transverse component of the sum of the muon and 6-jet momenta must 
satisfy px > 20 GeV. The results are based on data corresponding to integrated 
luminosities of 1.0 and 2.0 fb~^ collected at center-of-mass energies of 7 and 8TeV by 
LHCb. The inclusive top quark production cross-sections in the fiducial region are 

(T(top)[7TeV] = 239 ± 53 (stat) ± 33 (syst) ± 24 (theory) fb , 
cr(top)[8 TeV] = 289 ± 43 (stat) ± 40 (syst) ± 29 (theory) fb . 

These results, along with the observed differential yields and charge asymmetries, 
are in agreement with next-to-leading order Standard Model predictions. 
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The production of top quarks (t) from proton-proton (pp) collisions in the forward 
region is of considerable experimental and theoretical interest. In the Standard Model 
(SM), four processes make signihcant contributions to top quark production: ti pair 
production; single top production via processes mediated by a kb boson in the t-channel 
{qb —)■ q't) or in the s-channel {qq' —)■ tb)] and single top produced in association with a W 
boson {gb —?• tW). The initial-state b quarks arise from gluon splitting to bh pairs or from 
the intrinsic b quark content in the proton. Top quarks decay almost entirely via t —)• Wb. 
The SM predicts that about 75% of t —)■ Wb decays in the forward region are due to tt 
pair production. The remaining 25% are mostly due to t-channel single-top production, 
with s-channel and associated single-top production making percent-level contributions. 

The enhancement at forward rapidities of ti production via qq and qg scattering, 
relative to gg fusion, can result in larger charge asymmetries, which may be sensitive 
to physics beyond the SM [^[^. Forward ti events can be used to constrain the gluon 
parton distribution function (PDF) at large momentum fraction, resulting in reduced 
theoretical uncertainty for many SM predictions |^. Furthermore, both single-top and 
ti cross-section measurements in the forward region will provide important experimental 
tests of differential next-to-next-to-leading order theoretical calculations as they become 
available [^. 

This Letter reports the hrst observation of top quark production in the forward region. 
The data used correspond to integrated luminosities of 1.0 and 2.0 fb~^ collected at center-of- 
mass energies of ^/s = 7 and 8 TeV in pp collisions with the LHCb detector. The W bosons 
are reconstructed using the W ^ pu decay with muons having a transverse momentum, 
Pt, larger than 25 GeV (c = 1 throughout this Letter) in the pseudorapidity range 
2.0 < T] < 4.5. The analysis is performed using jets clustered with the anti-^T algorithm 
using a distance parameter R = 0.5. The jets are required to have 50 < pt < 100 GeV 
and 2.2 < p < 4.2. The muon and jet (j) must be separated by AR{p,j) > 0.5, with 
AR = Ap2 -|- A02. Here Ap (A0) is the difference in pseudorapidity (azimuthal angle) 
between the muon and jet momenta. The transverse component of the sum of the muon 
and jet momenta must satisfy Pt(p + j) = (p(f) + p(j))t > ^0 GeV. 

The LHGb detector is a single-arm forward spectrometer covering the pseudorapidity 
range 2 < p < 5, designed for the study of particles containing b or c quarks. It is 
described in detail in Refs. ||^[^ . The trigger consists of a hardware stage, based on 
information from the calorimeter and muon systems, followed by a software stage, which 
applies a full event reconstruction. This analysis requires at least one muon candidate that 
satishes the trigger requirement of pt > 10 GeV. Global event cuts (GECs), which prevent 
high-occupancy events from dominating the processing time of the software trigger, have 
an efficiency of about 90% for kF-|-jet and top quark events. 

Simulated pp collisions are generated using Pythia [9 with an LHGb conhguration 10 . 
Decays of hadronic particles are described by EvtGen ^ in which final-state radiation 
is generated using Photos [^. The interaction of the generated particles with the 
detector, and its response, are implemented using the Geant4 toolkit 13 as described in 
Ref. 14 . Further theory calculations are performed at next-to-leading order (NLO) with 
and the GTIO PDF set 


MGFM 15 


16 , and are cross-checked using PowhegBox 17 
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with hadronization simulated by Pythia. The theoretical uncertainty on the cross-section 
predictions is a combination of PDF, scale, and strong-coupling (a^) uncertainties. The 
PDF and scale uncertainties are evaluated following Refs. and [^, respectively. The 
as uncertainty is evaluated as the envelope obtained using as{Mz) G [0.117,0.118,0.119] 
in the theory calculations. 

The event selection is the same as that in Ref. 19 but a reduced hducial region is used 
to enhance the top quark contribution relative to direct W+b production. The signature 
for IF-l-jet events is an isolated high-px muon and a well-separated jet originating from the 
same pp interaction. Signal events are selected by requiring a high-px muon candidate and 
at least one jet with Ai?(p, j) > 0.5. For each event, the highest-px muon candidate that 
satishes the trigger requirements is selected, along with the highest-px jet from the same 
pp collision. The primary background to top quark production is direct W+b production; 
however, Z+b events, with one muon undetected in the decay Z —)■ p/i, and di-6-jet events 
also contribute to the p-|-6-jet final state. 

The anti-fex clustering algorithm is used as implemented in Fast Jet 


20 . Information 


from all the detector subsystems is used to create charged and neutral particle inputs to the 
jet-clustering algorithm using a particle flow approach (^. The reconstructed jets must 
fall within the pseudorapidity range 2.2 < p(j) < 4.2. The reduced p(j) acceptance ensures 
nearly uniform jet reconstruction and heavy-flavor tagging efficiencies. The momentum of 
a reconstructed jet is corrected to obtain an unbiased estimate of the true jet momentum. 
The correction factor, typically between 0.9 and 1.1, is determined from simulation and 
depends on the jet px and rj, the fraction of the jet px measured with the tracking system, 
and the number of pp interactions in the event. 

The high-px muon candidate is not removed from the anti-fcx inputs and so is clustered 
into a jet. This jet, referred to as the muon jet and denoted as j^, is used to discriminate 
between kF-l-jet and dijet events [^. No correction is applied to the momentum of the 
muon jet. The requirement PtU/j, + j) > 20GeV is made to suppress dijet backgrounds, 
which are well balanced in px, unlike lF-|-jet events, where there is undetected energy 
from the neutrino. Events with a second, oppositely charged, high-px muon candidate 
from the same pp collision are vetoed. However, when the dimuon invariant mass is in 
the range 60 < < 120 GeV, such events are selected as Z(p/i)-|-jet candidates, 

which are used to determine the Z+jet background. 

The jets are identihed (tagged) as originating from the hadronization of a 6 or c quark 
by the presence of a secondary vertex (SV) with AR < 0.5 between the jet axis and the SV 
direction of flight, defined by the vector from the pp interaction point to the SV position. 
Two boosted decision trees (BDTs) 1^,23 , trained on the characteristics of the SV and 


the jet, are used to separate heavy-flavor jets from light-parton jets, and to separate b jets 
from c jets. The two-dimensional distribution of the BDT responses observed in data is 
htted to obtain the SV-tagged b, c and light-parton jet yields. The SV-tagger algorithm is 
described in Ref. 24 , where the heavy-flavor tagging efficiencies and light-parton mistag 
probabilities are measured in data. The data samples used in Ref. are too small 


to validate the performance of the SV-tagger algorithm in the px(j) > 100 GeV region. 
Furthermore, the mistag probability of light-parton jets increases with jet px. Therefore, 
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Figure 1: Distribution of P'T{p)/p'T{jfjL) with fit overlaid for all VF+jet candidates. 


only jets with px < 100 GeV are considered in the hducial region, which according to 
simulation retains about 80% of all top quark events. 

Inclusive FF+jet production, i.e. where no SV-tag requirement is made on the jet, is 
only contaminated at the percent level by processes other than direct IF+jet production. 
Therefore, FF+jet production is used to validate both the theory predictions and the 
modeling of the detector response. Furthermore, the SM prediction for a{Wh)/a{Wj) has 
a smaller relative uncertainty than a{Wb) alone, since the theory uncertainties partially 
cancel in the ratio. The analysis strategy is to hrst measure the FF+jet yields, and then 
to obtain predictions for the yields of direct W + b production using the prediction for 
a{Wb)/a(Wj). To an excellent approximation, many experimental effects, e.g. the muon 
reconstruction efficiency, are expected to be the same for both samples and do not need to 
be considered in the direct FF+6 yield prediction. 

The FF+jet yield is determined by performing a ht to the /P^Uti) distribution 

with templates, histograms obtained from data, as described in Ref. (^. The Z+jet 
contribution is fixed from the fully reconstructed Z(/i/i)+jet yield, where the probability 
for one of the muons to escape detection is obtained using simulation. The contributions 
of b, c, and light-parton jets are each free to vary in the fit. Figure shows the fit for all 
candidates in the data sample. Such a ht is performed for each muon charge separately 
in bins of pt(f + j); the diherential FF+jet yield and charge asymmetry, dehned as 
[cr(FF+j) — cr(FF“j)]/[cr(FF+j) + (j(FF“j)], are given in Fig. 

To compare the data to theory predictions, the detector response must be taken into 
account. All signihcant aspects of the detector response are determined using data-driven 
techniques. The muon trigger, reconstruction, and selection efficiencies are determined 
using Z —)■ /i/i events 21,25 . The GEC efficiency is obtained following Ref. 


21 


an 


alternative dimuon trigger requirement, which requires a looser GEC, is used to determine 
the fraction of events that are rejected. Contamination from FF —)■ r —/i decays is 


estimated to be 2.5% using both simulated FF+jet events and inclusive FF data samples 26 


The fraction of muons that migrate out of the hducial region due to hnal-state radiation is 
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Figure 2: Results for the inclusive VF+jet yield (left) and charge asymmetry (right) versus 
Pt(/4 + j) compared to SM predictions at NLO obtained using MCFM. The data error bars are 
smaller than the marker size, the SM uncertainties are highly correlated across PT(h + j) bins. 


about 1.5% (^ . 

Migration of events in jet pt due to the detector response is studied with a data sample 
enriched in b jets using SV tagging. The PT(SV)/pT(j) distribution observed in data is 
compared to templates obtained from simulation in bins of jet px- The resolution and 
scale for each jet px bin are varied in simulation to find the best description of the data 
and to construct a detector response matrix. Figure shows that the SM predictions, 
obtained with all detector response effects applied, agree with the inclusive bF+jet data. 

The yields of W+c and W+b, which includes t —)■ Wb decays, are determined using 
the subset of candidates with an SV-tagged jet and binned according to Pt:{ij)/ptU^i)- 
In each Pt{iA !VtU^ l) din, the two-dimensional SV-tagger BDT-response distributions 
are htted to determine the yields of c-tagged and ^-tagged jets, which are used to form 
the p^ip)/P tU^ l) distributions for candidates with c-tagged and 6-tagged jets. These 
p^{p)/P tU^ l) distributions are htted to determine the SV-tagged W+c and W+b yields. 

A ht to the PT(h)/TT(j/i) distribution built from the c-tagged jets from the full data 
sample is provided as supplemental material to this Letter [^. Figure]^ shows that the 
VF-l-c yield versus px(p + c) agrees with the SM prediction. Since the IT-t-c hnal state 
does not have any signihcant contributions from diboson or top quark production in the 
SM, this comparison validates the analysis procedures. 

Figure shows a ht to the PT(p)/PT(j/x) distribution built from the ^-tagged jets 
from the full data sample. For Px(h)/PT(j/i) > 0.9 the data are dominantly from W 
decays. Figure shows the yield and charge asymmetry distributions obtained as a 
function of px(p + b). The direct W+b prediction is determined by scaling the inclusive 
VF+jet distribution observed in data by the SM prediction for a{Wb)/a{Wj) and by the 
5-tagging efficiency measured in data [^. As can be seen, the data cannot be described 
by the expected direct W+b contribution alone. The observed yield is about three times 
larger than the SM prediction without a top quark contribution, while the SM prediction 
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Figure 3: Results for W+c compared to SM predictions at NLO obtained using MCFM. 
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Fignre 4: Distribution of PTip)/PTUfj.) with fit overlaid for all W+b candidates. 


including both tt and single-top production does describe the data well. 
In Ref. 


19 , W+b is studied in a larger fiducial region (pt(r) > 20 GeV, PtO) > 20 GeV), 


where the top quark contribution is expected to be about half as large as that of direct W+b 
production. The ratio [a(Wb)+a{iop)\/a{Wj) is measured in the larger fiducial region to 
be 1.17 ± 0.13 (stat) ± 0.18 (syst)% at y/s = 7TeV and 1.29 ± 0.08 (stat) ± 0.19 (syst)% at 
y/s = 8 TeV. These results agree with SM predictions, that include top quark production, 
of 1.23 ± 0.24% and 1.38 ± 0.26%, respectively. This validates the direct W+b prediction, 
since direct W + b production is the dominant contribution to the larger fiducial region. 

Various sources of systematic uncertainties are considered and summarized in Table 
The direct W+b prediction is normalized using the observed inclusive lT-|-jet data yields. 
Therefore, most experimental systematic uncertainties cancel to a good approximation. 

Since the muon kinematic distributions in IT-l-jet and W+b are similar, all muon-based 
uncertainties are negligible with the exception of the trigger GEC efficiency. The data- 
driven GEC study discussed above shows that the efficiencies are consistent for PE-l-jet and 
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Figure 5: Results for the W + b yield (left) and charge asymmetry (right) versus pt(/4 + b) 
compared to SM predictions obtained at NLO using MCFM. 


W+b, with the statistical precision of this study assigned as the systematic uncertainty. 
Mismodeling of the PT{+)/pT{jii) distributions largely cancels, since this shifts the inclusive 
VF+jet and W+b hnal-state yields by the same amount, leaving the observed excess over 
the expected direct W+b yield unaffected. The one exception is possible mismodeling of 
the dijet templates, since the flavor content of the dijet background is not the same in the 
two samples. Variations of these templates are considered and a relative uncertainty of 5% 
is assigned on the W boson yields. 

The jet reconstruction efficiencies for heavy-flavor and light-parton jets in simulation 
are found to be consistent within 2%, which is assigned as the systematic uncertainty for 
flavor-dependencies in the jet-reconstruction efficiency. The SV-tagger BDT templates used 
in this analysis are two-dimensional histograms obtained from the data samples enriched 
in b and c jets used in Ref. [^. Following Refs. a 5% uncertainty on the ^-tagged 

yields is assigned due to uncertainty in these templates. The precision of the 6-tagging 
efficiency measurement (10%) in data 


24 is assigned as an additional uncertainty. 


To determine the statistical signihcance of the top quark contribution, a binned profile 
likelihood test is performed. The top quark distribution and charge asymmetry versus 
Pt{+ + 6) are obtained from the SM predictions. The total top quark yield is allowed 
to vary freely. Systematic uncertainties, both theoretical and experimental, are handled 
as Gaussian constraints. The prohle likelihood technique is used to compare the SM 
hypotheses with and without a top quark contribution. The signihcance obtained using 
Wilks theorem is 5.da, conhrming the observation of top quark production in the 
forward region. 

The yield and charge asymmetry distributions versus PT(h + 6) observed at y/s = 7 
and 8 TeV are each consistent with the SM predictions. The excess of the observed yield 
relative to the direct W+b prediction at each y/s is attributed to top quark production, 
and used to measure the cross-sections. Some additional systematic uncertainties that 
apply to the cross-section measurements do not factor into the signihcance determination. 
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Table 1: Relative systematic uncertainties. The symbol f denotes an uncertainty that only 
applies to the cross-section measurement and not the significance determination. Only the 
luminosity uncertainty depends on ^/s■. 2% at 7 TeV and 1% at 8 TeV. 


source 

uncertainty 

GEG 

2% 

Pt{p)/PT{ j^i) templates 

5% 

jet reconstruction 

2% 

SV-tag BDT templates 

5% 

6-tag efficiency 

10% 

trigger & p selection 

2%t 

jet energy 

5%t 

W ^ T ^ p 

l%t 

luminosity 

l-2%t 

Total 

14% 

Theory 

10% 


The uncertainties due to the muon trigger, reconstruction, and selection efficiencies are 
taken from the data-driven studies of Refs. 21,25. The uncertainty due to the jet energy 


determination is obtained from the data-driven study used to obtain the detector response 
matrix. The uncertainty due to IT —>■ r —>■ /i contamination is taken as the difference 
between the contamination in simulation versus that of a data-driven study of inclusive 
IT —)■ /iz/ production |26 . The luminosity uncertainty is described in detail in Ref. 


29 


The total systematic uncertainty is obtained by adding the individual contributions in 
quadrature. 

The resulting inclusive top production cross-sections in the fiducial region defined by 
Pt(r) > 25GeV, 2.0 < < 4.5, 50 < pT{b) < 100 GeV, 2.2 < 7]{b) < 4.2, AR(/i, 6) > 

0.5, and + 6) > 20 GeV, are 


cr(top) [7 TeV] = 239 ± 53 (stat) ± 33 (syst) ± 24 (theory) fb , 
cr(top) [8 TeV] = 289 ± 43 (stat) ± 40 (syst) ± 29 (theory) fb . 


The systematic uncertainties are nearly 100% correlated between the two measurements. 

In summary, top quark production is observed for the first time in the forward region. 
The cross-section results are in agreement with the SM predictions of 180jl|} (312^^1) fb at 
7(8) TeV obtained at NLO using MCFM. The differential distributions of the yield and 
charge asymmetry are also consistent with SM predictions. 
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